Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
Мындай | 1822 | 73 | 1 | 73.0000 |
жээгинен | 3251 | 275 | 4 | 68.7500 |
нече | 2815 | 130 | 2 | 65.0000 |
Мында | 1458 | 58 | 1 | 58.0000 |
Көп | 729 | 43 | 1 | 43.0000 |
Негизинен | 632 | 36 | 1 | 36.0000 |
Эгерде | 883 | 33 | 1 | 33.0000 |
эми | 4672 | 171 | 6 | 28.5000 |
Негизги | 880 | 56 | 2 | 28.0000 |
кандай | 3236 | 181 | 8 | 22.6250 |
Москвадагы | 249 | 22 | 1 | 22.0000 |
Илимий | 309 | 21 | 1 | 21.0000 |
Бир | 1783 | 82 | 4 | 20.5000 |
Калкы | 645 | 20 | 1 | 20.0000 |
анткени | 565 | 20 | 1 | 20.0000 |
аттуу | 1950 | 136 | 7 | 19.4286 |
ара | 1711 | 134 | 7 | 19.1429 |
бирок | 2242 | 70 | 4 | 17.5000 |
Кен | 726 | 33 | 2 | 16.5000 |
алдынча | 955 | 48 | 3 | 16.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
Муниципалитеттин | 5843 | 1 | 300 | 0.0033 |
Мэр | 1421 | 1 | 136 | 0.0074 |
турат | 5572 | 6 | 327 | 0.0183 |
колдонулат | 3502 | 3 | 154 | 0.0195 |
кирет | 3033 | 3 | 113 | 0.0265 |
кездешет | 1821 | 3 | 90 | 0.0333 |
алынат | 1548 | 3 | 85 | 0.0353 |
иштейт | 1353 | 3 | 78 | 0.0385 |
башталып | 509 | 1 | 25 | 0.0400 |
пайдаланылат | 1200 | 2 | 48 | 0.0417 |
кандайдыр | 724 | 1 | 21 | 0.0476 |
жасалат | 653 | 2 | 41 | 0.0488 |
болот | 11073 | 21 | 416 | 0.0505 |
бөлүнөт | 2190 | 3 | 59 | 0.0508 |
баштады | 237 | 1 | 19 | 0.0526 |
алат | 2297 | 8 | 150 | 0.0533 |
түзөт | 2396 | 6 | 112 | 0.0536 |
өсөт | 886 | 3 | 55 | 0.0545 |
өтөт | 1356 | 4 | 73 | 0.0548 |
берет | 2657 | 7 | 126 | 0.0556 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II